Explore
Featured
Recent
Articles
Topics
Login
Upload
Featured
Recent
Articles
Topics
Login
Upload
Search Results for 'kernel gpu'
kernel gpu published presentations and documents on DocSlides.
GPU Acceleration in ITK v4
by tawny-fly
ITK v4 . . w. inter . meeting. Feb 2. nd. 2011....
Scheduling Techniques for GPU Architectures
by giovanna-bartolotta
Scheduling Techniques for GPU Architectures with ...
GEN : A GPU-Accelerated Elastic Framework for
by briana-ranney
. NFV. Zhilong. . Zheng. . Jun. . Bi. . ...
CS 179: GPU Computing
by faustina-dinatale
Lecture 2: more basics. Recap. Can use GPU to sol...
GPU based ARAP Deformation using Volumetric Lattices
by conchita-marotz
M. Zollhöfer, E. Sert, G. Greiner and J. Süßmu...
CS 179: GPU Computing
by sherrill-nordquist
Lecture 2: more basics. Recap. Can use GPU to sol...
GPU Programming using BU Shared Computing Cluster
by crunchingsubway
Research Computing Services. Boston . University. ...
Scalable Distributed Fast
by molly
Multipole. . Methods. Qi Hu, Nail A. Gumerov, R...
Orchestrating Multiple Data-Parallel Kernels on Multiple De
by faustina-dinatale
Janghaeng Lee. , . Mehrzad. . Samadi. , and Scot...
Sponsors
by debby-jeon
: National Science Foundation, LogicBlox Inc. . ,...
Sponsors
by celsa-spraggs
: National Science Foundation, LogicBlox Inc. . ,...
GPU Computing: Pervasive Massively
by lindy-dunigan
Multithreaded Processors. Michael C Shebanow. Sr....
CS 179 Lecture 13
by alexa-scheidler
Host-Device Data Transfer. 1. Moving data is slow...
Polly-ACC: Transparent Compilation to Heterogeneous Hardwar
by danika-pritchard
Tobias Grosser, . Torsten. . Hoefler. 1. LLVM Wo...
CS 179 Lecture 13
by kittie-lecroy
Host-Device Data Transfer. 1. Moving data is slow...
HSAemu
by lindy-dunigan
- A Full System Emulator for HSA Platform. Prof....
Efficient computation of sum-products on GPUs
by myesha-ticknor
M. . Siberstein. , A. Schuster, D. Geiger, A. . P...
Dissertation Defense
by tawny-fly
Robert . Senser. October 29, 2014. 1. GPU DECLARA...
Hanjin Chu, Director, Heterogeneous solutions, AMD China
by tawny-fly
Heterogeneous System Architecture (HSA) . and the...
CS179: GPU Programming
by yoshiko-marsland
Lecture . 7: Lab 3 Recitation. Today. Miscellaneo...
Fluidic Kernels: Cooperative Execution of
by pasty-toler
OpenCL. Programs on Multiple Heterogeneous Devic...
More Charm++/TAU examples
by yoshiko-marsland
Applications:. NAMD. Parallel Framework for Unstr...
Scalable Fast Multipole Methods on Distributed Heterogeneous Architecture
by lindy-dunigan
Qi Hu, Nail A. Gumerov, Ramani Duraiswami. Inst...
S N Transport on accelerators
by pasty-toler
DOE . CoE. Portability Workshop 4/19/16. Steven ...
GenIDLEST Co-Design Virginia Tech
by liane-varnes
1. AFOSR-BRI Workshop. July 23 2014. Amit . Amrit...
Scalable Multi-Cache Simulation Using GPUs
by tawny-fly
Michael . Moeng. Sangyeun. Cho. Rami. . Melhem....
Portable Performance on Heterogeneous Architectures
by ellena-manuel
Phitchaya. . Mangpo. . Phothilimthana. Jason . ...
Profiling Heterogeneous Multi-GPU Systems to Accelerate Cortically Inspired Learning Algorithms
by tawny-fly
Review student: Fan . Bai. Instructor: Dr. . Sush...
June 24, 2013 Jason Su Technologies for C/C /Fortran
by briana-ranney
Single machine, multi-core. P(OSIX) threads: bare...
Heterogeneous Task Execution Frameworks in Charm++
by camstarmy
Michael Robson. Parallel Programming Lab. Charm Wo...
Lecture 13: Manycore GPU Architectures and Programming, Part 3 -- Streaming, Library and Tuning
by lindsaybiker
Programming, Part 3. -- Streaming, Library and Tun...
CUDA Overview
by everly
Cliff Woolley NVIDIADeveloper Technology GroupGPUC...
Calculation of RI-MP2 Gradient Using Fermi GPUs
by ivy
Jihan Kim. 1. , Alice Koniges. 1. , Berend Smit. 1...
ACCELERATING SPARSE CHOLESKY FACTORIZATION ON GPUs
by ellena-manuel
Dileep Mardham. Introduction. Sparse Direct Solve...
CUDA - 101 Basics Overview
by broadcastworld
What is CUDA?. Data Parallelism. Host-Device model...
VAST: The Illusion of a Large Memory Space for GPUs
by luanne-stotts
Janghaeng Lee. , . Mehrzad. . Samadi. , and . Sc...
Stencil Framework for Portable High Performance Computing
by jane-oiler
Naoya Maruyama. RIKEN Advanced Institute for Comp...
Warped-Slicer:
by alida-meadow
. Efficient Intra-SM Slicing through Dynamic Res...
Automatically Exploiting Implicit
by debby-jeon
Pipeline . Parallelism from . Multiple . Dependen...
Automatic Data Placement Into GPU On-Chip
by test
Memory Resources. ...
Load More...